Auditory/visual speech in multimodal human interfaces

نویسندگان

  • Dominic W. Massaro
  • Michael M. Cohen
چکیده

Program in Experimental Psychology University of California Santa Cruz, CA 95064 ABSTRACT It has long been a hope, expectation, and prediction that speech would be the primary medium of communication between humans and machines. To date, this dream has not been realized. We predict that exploiting the multimodal nature of spoken language will facilitate the use of this medium. We begin our paper with a general framework for the analysis of speech recognition by humans and a theoretical model. We then present a system for auditory/visual speech synthesis that performs complete text-to-speech synthesis. This system should improve the quality as well as the attractiveness of speech as one of a machine’s primary output communication medium. Mirroring the value of multimodal speech synthesis, multimodal channels should also enhance speech recognition by machine.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multimodal Metaphors for Note taking in E-learning

This research investigates the use of multimodal metaphors to communicate information in the interface of e-learning application in order to reduce the visual communication by incorporating auditory stimuli. Three experimental studies conducted to investigate the effect of using multimodal metaphors in e-learning applications. The first experiment introduced an empirical study to investigate th...

متن کامل

Eyebrow movement as a cue to prominence

INTRODUCTION Speech communication is inherently multimodal in nature. While the auditory modality often provides the phonetic information necessary to convey a linguistic message, the visual modality can qualify the auditory information providing segmental cues on place of articulation, prosodic information concerning prominence and phrasing and extralinguistic information such as signals for t...

متن کامل

Efficiency of Speech Recognition for Using Interface Design Environments by Novel Designers

Previous studies on usability of graphical design-widgets, like menus and buttons, proposed the use of speech and non-speech (earcons and auditory icons) for solving their usability problems. In this paper we investigate speech as an input metaphor to enhance learnability, or the ability to use a system with no prior knowledge, in order to design interfaces using a multimodal interface design t...

متن کامل

The role of expressive full - body avatars and earcons and auditory icons in e - assessment interfaces

This paper investigates the role and effectiveness of multimodal metaphors in e-assessment interfaces. It evaluates usability of specific combinations of multimodal metaphors on their own or in combination with other. The parameters of the evaluated usability included efficiency, effectiveness and user satisfaction. The empirical research described in this study consisted of three experiments o...

متن کامل

Modality-specific Affective Responses and their Implications for Affective BCI

Reliable applications of multimodal affective brain-computer interfaces (aBCI) require a detailed understanding of the processes involved in emotions. To explore the modality-specific nature of affective responses, we studied neurophysiological responses of 24 subjects during visual, auditory, and audiovisual affect stimulation and obtained their subjective ratings. Coherent with literature, we...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1994